Project: Investigate a Dataset (TMDb Movies Dataset)

Table of Contents

Introduction

In this section of the report, we will introduce the dataset

This dataset was generated from The Movie Database API. This product uses the TMDb API but is not endorsed or certified by TMDb

What can we say about the success of a movie before it is released? Are there certain companies (Pixar?) that have found a consistent formula? Given that major films costing over $100 million to produce can still flop, this question is more important than ever to the industry. Film aficionados might have different interests. Can we predict which films will be highly rated, whether or not they are a commercial success?

Data Wrangling

Tip: In this section of the report, you will load in the data, check for cleanliness, and then trim and clean your dataset for analysis. Make sure that you document your steps carefully and justify your cleaning decisions.

General Properties

Data Cleaning

cleaning steps in the second part of this section.

1. Rearrange columns

2. Removing columns with unnecessary Values

3. Changing the release date column into standard date format

4. Changing format of budget and revenue columns

5.Replace value "0" with mean

mean value before Replace "0"value

mean value after Replace "0"value with the mean

7. making new Data Frame with profit with less Column

8. explore data with keywords 'woman director' and making new dataframe

9. making new Data Frame with out woman director

Exploratory Data Analysis

Tip: Now that you've trimmed and cleaned your data, you're ready to move on to exploration. Compute statistics and create visualizations with the goal of addressing the research questions that you posed in the Introduction section. It is recommended that you be systematic with your approach. Look at one variable at a time, and then follow it up by looking at relationships between variables.

Explore what the histogram of the data looks like

Explore Men_Director

Explore Woman_Director

1. we can notice that woman start director films to be notice more than 15 year after 2000

2. women like to directer short film and a little make long film

3. men like to directer films between 120 and 180 min

Research Question 1

What is the relationship between runtime and profit?

. most films runtime between 100 min to 200 min make profit 0 and o.5 million $

Research Question 2

what Movies with most and least profit?